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We study structural properties of growing networks where both addition and deletion of nodes are 
possible. Our model network evolves via two independent processes. With rate r, a node is added 
to the system and this node links to a randomly selected existing node. With rate 1, a randomly 
selected node is deleted, and its parent node inherits the links of its immediate descendants. We 
show that the in-component size distribution decays algebraically, k ^ oo. The 

exponent /3 = 2 + (r — varies continuously with the addition rate r. Structural properties of 
the network including the height distribution, the diameter of the network, the average distance 
between two nodes, and the fraction of dangling nodes are also obtained analytically. Interestingly, 
the deletion process leads to a giant hub, a single node with a macroscopic degree whereas all other 
nodes have a microscopic degree. 

PACS numbers: 89.75,Hc, 05.40.-a,05.20.Dd, 02.50.Ey 
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I. INTRODUCTION 

Idealized models of random networks provide an im- 
portant tool for characterizing real-world networks such 
as communication, infrastructure, and social networks 
[11, 0, 13] ■ For example, the emergence of hubs in hetero- 
geneous networks is a consequence of a "rich-gets-richer" 
mechanism. This phenomenon arises in networks that 
grow via preferential attachment [1, [E, Q . 

Most network growth models are based upon sequen- 
tial addition of nodes and subsequent attachment to ex- 
isting nodes according to a prescribed mechanism. This 
framework does not allow for removal of nodes. In re- 
ality, there are two types of networks. One type of net- 
works including for example citation networks may only 
expand, while another type of networks such as friend- 
ship networks may either expand or contract. Indeed, 
individuals may join or leave a social group 0, Q. The 
same is true for technological networks such as the world- 
wide-web because websites may disappear. 

There is substantial theoretical understanding of the 
structure of strictly expanding networks, but much less 
is known about networks that may also contract 0, H, H, 
In the latter case, it is typically impossible 
to construct closed equations for quantities such as the 
degree distribution because removal of nodes generates 
memory or correlations. 

In this paper, we introduce a simple network growth 
process with addition and deletion of nodes and show 
that it is possible to obtain closed equations for several 
structural properties of the network. In our growing net- 
work model, nodes may be added or deleted. When a 
node is added to the network, it is attached to a ran- 
domly selected existing node. When a node is deleted, its 
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daughters are attached to its parent. This process is rel- 
evant for phylogenetic trees, namely trees that document 
the ancestries of distinct biological species [13, [H, [3 
because in the process of evolution, mutation events gen- 
erate new nodes while extinction events result in deletion 
of existing nodes [17, 18, 19, 20, 2^. 



We first investigate the in-component size distribu- 
tion and show that this quantity obeys a closed evolu- 
tion equation. We analytically obtain many characteris- 
tics of this distribution including the generating function, 
the moments, and the tail behavior. The in-component 
size distribution decays algebraically and the character- 
istic exponent varies continuously with the addition rate. 
Such behavior is in contrast with preferential attachment 
networks where this exponent is fixed. 



Next, we obtain closed equations for the height dis- 
tribution that characterizes the distance between a node 
and the root. This distribution obeys Poisson statistics. 
Other structural characteristics such as the diameter of 
the network and the degree of the most connected node 
follow directly from this quantity. In general, deletion 
leads to a condensation phenomena where a single node 
is connected to a finite fraction of all nodes, while the 
rest of the nodes have only a finite degree. 



The rest of this paper is organized as follows. The 
addition-deletion process is introduced in section II. The 
in-component size distribution and related quantities 
such as the fraction of dangling nodes are discussed in 
section III. Next, in section IV, the height distribution 
and related quantities including the diameter are de- 
scribed. The degree distribution is analyzed using an 
approximate theory and numerical simulations in section 
V. Section VI contains a brief summary. Finally, appen- 
dices El and [B] contain several technical derivations. 
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FIG. 1: Illustration of the node addition process. 




FIG. 2: Illustration of the node deletion process. 



II. THE ADDITION-DELETION PROCESS 

In the addition-deletion process, initially there is one 
seed node, and then, the network grows via addition of 
nodes and shrinks via deletion of nodes as follows. 

1. The addition process. With rate r, a new node 
is added and it links to a randomly selected node 
(Fig. 1). This is an egalitarian attachment process 
as the new node is equally likely to link to any one 
of the existing nodes. 

2. The deletion process. With rate 1, a randomly 
selected node (together with its outgoing link) is 
deleted. The parent of the deleted node inherits 
all the incoming links of the deleted node (Fig. 2). 
This inheritance mechanism preserves ancestral re- 
lations as is the case in phylogenetic trees: when a 
species goes extinct, its immediate descendants are 
linked to its immediate ancestor. 

We stress that the addition process and the deletion pro- 
cess are completely independent of each other and that in 
the deletion process all links are removed with an equal 
probability. 

This addition-deletion process preserves the complex- 
ity of the graph since nodes and links are added and 
deleted in pairs. Thus, starting with a single seed node 
and no links, the network has a tree structure. The seed 
node can not be deleted (or the ancestry would be de- 
stroyed) and therefore, the seed is the root of the tree. 

The average total number of nodes at time t, N{t), 
evolves according to dN/dt — (r — 1). Therefore, this 
quantity grows linearly with time and since A^(0) = 1, 

N{t)^l + (r-l)t. (1) 

The total number of links, L{t), follows from L — N — 1 
and therefore, L{t) — {r — l)t. We restrict our attention 
to the case of growing networks, r > 1. In this case, the 
size of the network increases indefinitely with time and 



fluctuations are negligible in the long time limit. The 
total number of nodes grows diffusively and is subject to 
large fluctuations in the limit r 1. 

III. THE IN-COMPONENT SIZE 
DISTRIBUTION 

The in-component of a node n is the set of all nodes 
which are connected to n via a path of directed links. In 
the context of a phylogenetic tree, the in-component is 
simply the set of all descendants. Let Ck be the aver- 
age number of nodes with an in-component of size k. In 
our definition, the in-component includes the node itself, 
and therefore fc > 1. The in-component size distribution 
evolves according to 

dCi. r k 

+ ^ [ik-l)Ck-i-kCk] + j [Ck+i-Ck] . (2) 

The first three terms correspond to changes caused by 
node addition. New nodes generate a new in-component 
of size one and thus, the first term. In-components grow 
by one when a node links to any of the nodes in that in- 
component and the next two terms account for this aug- 
mentation. The corresponding attachment rate is the ad- 
dition rate r normalized by the total number of links N. 
The last two terms correspond to node deletion. Deletion 
of any one of the k links in an in-component of size k + 1 
generates an in-component of size k and hence the gain 
term. Similarly, deletion of any one of the k nodes in an 
in-component of size k decreases the in-component size 
by one and hence the loss term. All nodes except for the 
root may be deleted and accordingly, the deletion rate 
is normalized by the total number of links L. By sum- 
ming equations ^ one can verify that the total number 
of nodes = 1 -|- J2k>i satisfies dN/dt — r — 1. 

We are interested in the long time asymptotics where 
the difference between the total number of nodes and the 
total number of links is negligible. Therefore, we replace 
L with iV in ([2]). We also make the transformation 

Ck = Nck (3) 

where is the fraction of nodes with in-component de- 
gree equal to k and X]fe>i Cfc = 1- The distribution fmic- 
tion Ck satisfies the difference equation 

(r-l)c/c=r [5kA + {k-l)ck-i-kck\ + k{ck+i-Ck). (4) 

The tail of this distribution can be obtained using a con- 
tinuum approximation, that is, through replacement of 
differences with derivatives: Ck+i — Ck dc/dk, etc. 
This change transforms the difference equation (U) into 
the differential equation 

(IK QjK 

The solution of this equi-dimensional equation is a power- 
law 

Ck^Bk-f", /3 = 2 + ^, (6) 



as A; ^ oo. Thus, the in-component size distribution 
has a power-law tail. The exponent 2 < /3 < oo varies 
continuously with the addition rate. The characteristic 
exponent is minimal, /? = 2, for the well-understood case 
of random recursive trees (r—^ oo) where deletion is ir- 
relevant [Ullllii,!!!,!!!,!!!. The exponent diverges in 
the more interesting limit of marginally growing networks 
(r 1) where deletion is as strong as addition. 

Addition-deletion networks differ from strictly grow- 
ing networks where the exponent characterizing the in- 
component size distribution is independent of the details 
of the attachment mechanism, (3 = 2 [2^ . Thus the dele- 
tion process qualitatively changes the network structure. 

The moments M„ = k^Ck provide complementary 
information about the in-component size distribution. Of 
course, Mq = — 1. The first moment satisfies 



dMi 

Iln 
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(7) 



as seen by summing ([2]). Equation ([7]) describes the evo- 
lution as a function of the total number of nodes rather 
than time. Since we are interested in the asymptotic 
behavior, we again replace L with N m Solving 
Eq. ([7]), we find that the first moment is proportional 
to the number of nodes, Mi = (1 -I- r)N . Hence the av- 
erage in-component size is linear in the addition rate, 
(fc) = 1 -I- r. 

Similarly, the second moment obeys 



dM2 

Iln 



2r 
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3 M2 
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There are three types of asymptotic behavior 



(l!±5r±2)^ 
2 — r 

16NlnN 



1 < r < 2; 
r = 2; 
r > 2. 



(8) 



(9) 



The proportionality constant n depends on the initial 
conditions. These asymptotic behaviors are consistent 
with the algebraic decay (O: the second moment grows 
linearly when /3 > 3, it increases super-linearly when 
/3 < 3, and it acquires a logarithmic correction in the 
marginal case (3 = 3. 
The generating function 



F{x)=J2 



CkX 



(10) 



fc=i 



contains complete information about the in-component 
size distribution. Normalization implies F{1) = 1 and 
since cq = 0, then F(0) = 0. Multiplying Eq. (g]) by x'' 
and summing over fc, we arrive at a first order differential 
equation for the generating function 

{x-l){rx-l)F'{x) + (l - r - x^^) F{x) + rx = (11) 

where the prime denotes differentiation with respect 
to X. The homogeneous part of equation (fTT|) ad- 



mits the solution x{l 



1 



We make 




0.4 0.6 
1/r 



FIG. 3: The fraction of dangling nodes ci versus inverse of 
the addition rate 1/r (solid line). Also shown is the large-r 
asymptotic behavior ci = | + (dashed line). 



the transformation F{x) = x{l — x)^~^{rx — a{x) 
when 1/r < x < 1, and then the auxiliary function a{x) 
satisfies a' = r (1 — x)^^{rx — 1)^^^. Integration of this 
equation yields the generating function 



F{x) 



dy 



[ry - lf~ 



[rx-iy J I/, " (l-y)/3 



(12) 



in the range 1/r < x < 1. The lower limit of integra- 
tion was chosen to assure that F{l/r) is finite. One can 
verify that lim^;^! F{x) = 1 in agreement with the nor- 
malization requirement. A similar calculation gives the 
generating function in the range < a; < 1/r 

x(l-x)l^-^ , (l-ry)!^-^ , , 

F{x) = r^^ '-— dy^—^i^. (13) 



(1 - rx)f' 



The large-fc behavior is encoded in the a; — > 1 behavior 
of the generating function. First, we confirm the power- 
law tail ^ by using p^ . Second, we obtain the corre- 
sponding proportionality constant 



i? = r(/3)(/3-i) 



(14) 



The appendix details the analysis leading to this result. 

Conversely, the small-fc behavior is refiected by the 
X — *■ behavior of the generating function. The fraction 
of dangling nodes, that is, nodes with no incoming links, 
is given by ci = lhiix~,oF'{x) and by differentiation of 
the generating function we readily find 



Cl 



dy{l-yf-\l~y/ry 



(15) 



The integral can be expressed via the hypergeometric 
function, ci = (3~^F[(3, 1; /3 + 1; r~^), or alternatively as 
the power series ci = 

Efc>o(^ + The quantity 

Cl is a monotonically decreasing function of the addition 
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rate r and is roughly linear in 1/r (figure [3]). It is max- 
imal for marginally growing networks (see subsection A 
below) , 



lim ci = e £^1 (1) = 0.596347 . . . , 



(16) 



where Ei{x) = dzz'^e''^ is the exponential integral 
[28t . The fraction of dangling nodes is minimal for ran- 
dom recursive trees, limr^oo ci = 1/2. In general, over 
half of the nodes are dangling nodes. 

The in-component size distribution can be cal- 
culated recursively for small k by using ([4]), 
C2 = r(2ci - 1), C3 = r [(6r + l)ci - (3r + 1)], etc. 

We now investigate the limits r — s- 1 and r — > oo in 
detail. 



A. Marginally Growing Networks 

The recursion relation (jj]) becomes 

Cfe+i = 4,1 + (fc - l)cfc-i - 2kck + (fc + l)cfe+i (17) 

for marginally growing networks (r — > 1). Repeating 
the steps leading to ([5]) yields the second order linear 
differential equation 



(18) 



The tail of the in-component size distribution has a 
stretched exponential tail 



Cfc 



(19) 



a result that can be obtained using the WKB technique. 
This sharp decay is consistent with the divergent (3. 

To find the generating function, we repeat the steps 
leading to ([T2|) . The governing equation (|lip becomes 
{x— 1)"^ F' — F + X — 0, and the solution of this equa- 
tion is F{x) = J^^'' ^ exp ( - i). The gener- 
ating function can be expressed in terms of the exponen- 
tial integral 



F{x) 



■e~Ei 



1 



(20) 



1 — a; "\^1 — a: 

from which the fraction of dangling nodes (fT6|) follows. 

B. Random Recursive Trees 

The second limiting case, r — > cxi, corresponds to ran- 
dom recursive trees. Now, the in-component size distri- 
bution satisfies the recursion equation 



{k + l)cfe = (fc - l)cfe-i -I- 4,1- 



(21) 



Unlike in the general case, this simpler equation can be 
solved recursively starting at fc = 1: ci = j^, C2 = 



C3 



gij, etc. We thus recover the well-known result |26 



Cfe 



k{k + l) 



(22) 



Alternatively, one can obtain the distribution from the 
generating function F{x) = 1 + ln(l — x). 



IV. THE HEIGHT DISTRIBUTION 

Another important structural characteristic is the 
height of a node, that is, the number of links separat- 
ing a given node from the root [2^, [s^l . Let Hk (t) be the 
average number of nodes with height k at time t. This 
quantity evolves according to 



dHk 
dt 



N 



fc-i 



fe+i 



(23) 



for k > 1. The initial condition is -fffe(O) = 5k.o and 
the boundary conditions are Hq — 1 and H-i = 0. The 
first term on the right-hand side accounts for gain due to 
node addition and the next two terms describe changes 
due to node deletion. The factor k reflects that deletion 
of a node of height k reduces the heights of all of its 
descendants by one. Again, the addition and the dele- 
tion rates are normalized by the total number of nodes 
and the total number of links, respectively. One can ver- 
ify that the total number of nodes N = J^k satisfies 
dN/dt = r - 1. 

Following the above analysis of the in-component size 
distribution, we study the long time asymptotic behav- 
ior of hk, the fraction of nodes with height k, using the 
transformation 



Hk = Nhk 



(24) 



Normalization implies hk = I and the boundary con- 
dition is now ho = since Hq = 1 . Let us substitute ((24l) 
into the rate equation (|23p . The distribution function hk 
satisfies the recursion relation 

r{hk - hk-i) + (hk+i - hk) ^ {k + l)hk+i - khk- (25) 

This equation was obtained by adding Hk+i to both side 
of (ESI). 

In this case, it is convenient to use the exponential 
generating function, defined via 



oo 

G(z) = hke"' 

k=0 



(26) 



Multiplication of the recursion relation (I^S)) by e*^^ and 
summation over k leads to the first order ordinary differ- 
ential equation 



dG 

dz 



G(z)(l + re"). 



(27) 
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The generating function is found by integration of this 
equation subject to the boundary condition G(0) — 1, 



G{z. 



(28) 



Using the definition (|26p we find that the height distri- 
bution function is Poissonian 



„fe-i 



(fc-l)!' 



(29) 



The moments of the height distribution 
m„ = 'Ylik ^"^^k can be found by differentiation 
of the generating function ([28)1 . For instance, 
TOi — G"(0) = 1 + r, and therefore the average 
node height is equal to the average in-component size, 
(fc) = 1 + r. Similarly, fluctuations in the height are ob- 
tained from the second moment m2 
and hence, the variance is 



c"(0) = r2 + 3r+l 



(30) 



This variance is sufficient to characterize the behavior in 
the vicinity of the average, where the Poissonian height 
distribution becomes Gaussian. 

We can immediately deduce a number of characteris- 
tics using the height distribution. There are nodes 
at height k with incoming links from the H^+i nodes at 
height /c + 1. Thus, the average in-degree Ik of nodes at 
height k IS Ik ^ Hk+i/Hk- Using Hq = 1 and ((29|) we 
find 



Ik = 



A^e-'' /c = 0; 
rk^^ k > 0. 



(31) 



The second result shows that the average degree is in- 
versely proportional to the height. 

The root, with its macroscopic degree, is the most con- 
nected node whereas all other nodes have a microscopic 
degree. Hence, the root is a giant hub. The degree of 
the root is maximal for marginally growing networks, 
limr^i Iq = Ne~^, and it decreases indefinitely as r in- 
creases. Therefore, deletion must be responsible for this 
condensation phenomena. Indeed, since the attachment 
rate to each node is 1/N, addition can be responsible for 
at most In N of the Ne~'^ connections as is the case for 
random recursive trees 0, [2^ HI]. We conclude that 
emergence of the giant hub is due to deletion. 

Consider now the highest node. The maximal height 
kmax is determined from the extreme statistics criterion 
^ 1- Using the Poisson distribution (|29|) and the 
Stirling formula fc! ^ (fe/e)'', the highest node grows very 
slowly with the network size, /Cmax ~ \nN/\n{lnN). Re- 
markably, the maximal height is independent of the ad- 
dition rate r. 

Asymptotically, the diameter of the network D is twice 
the maximal height 



D 



IniV 
ln(ln N) ' 



(32) 



For strictly growing random trees, the diameter exhibits a 
logarithmic growth independent of the attachment mech- 
anism, D ^ \nN [2^. However, the diameter can be 
affected by the topology. If every new node links to m 
existing nodes, this strictly growing network is no longer 
a tree (for to > 2) and D ^InN/ InlniV. This result jHH 
is again very robust, namely it holds for strictly growing 
networks with reasonable attachment mechanisms and 
for all TO > 2. Equation ([32]) shows that deletion qualita- 
tively affects the growth of the diameter. Surprisingly, 
despite their distinct topologies these strictly growing 
networks and the addition-deletion networks have sim- 
ilar diameters. 

Finally, we address the average distance between two 
nodes, (l). Since the degree of the root is proportional 
to the total number of nodes N , a path connecting two 
randomly selected nodes almost surely includes the root. 
As a result, the average distance is asymptotically equal 
to twice the average height (Z) = 2(1 + r). 



V. THE DEGREE DISTRIBUTION 

In contrast with the in-component size distribution 
and the height distribution, it is impossible to construct 
closed equations for the degree distribution and we per- 
form an approximate analysis. 

When a node with in-degree j is deleted, the in-degree 
of the parent i is augmented by j — 1. This is simply the 
aggregation process |32|] 



[hjj > l + J 



1. 



(33) 



The aggregation rate is proportional to the in-degree of 
the parent node because any of the daughter nodes can 
be deleted. The normalization factor NL reflects that 
the probability of picking the deleted node is inversely 
proportional to the number of nodes and the probability 
of picking the parent node is inversely proportional to 
the number of links. 

Let Ak{t) be the average number of nodes with in- 
degree k at time t (the out-degree is always one). The 
total number of nodes and the total number of links are 
given by = J2k ^fc ^^'^ ^ — Y^k k^k, respectively. As- 
suming that the degrees of neighboring nodes are com- 
pletely uncorrelated, the quantity Ak obeys the nonlinear 
rate equation [3I, [SJ] 



— rdk n ^ [Ak-i ^ Ak] 



(34) 



1 



^ lAiAj - [kN + L)Ak 

i+]=k+i 



The initial condition is Ak{Q) — Skfi and the bound- 
ary condition is A_i{t) = 0. The first three terms 
represent addition events. These terms are linear and 
thus, exact. The last three terms account for deletion 
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FIG. 4: The degree distribution for the aggregation process 
and for the addition-deletion network. Shown are results of 
one Monte Carlo simulation run with 10^ nodes and r — 4. 
Roughly 90% of the nodes have a degree smaller or than 
k = 10.' 
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FIG. 5: The tail of the degree distribution for the addition- 
deletion network. The simulation results were obtained from 
10'^ independent realizations in a network with 10^ nodes and 
r = 4. The decay exponent is 7 = 2.45 ± 0.05 



events. These terms are non-linear as they represent 
two node interactions. Moreover, the gain term has a 
convolution structure, reflecting the aggregation process 
(|33| . Of course, the aggregation process becomes ex- 
act in the limit r — > 00 where deletion becomes irrele- 
vant. By summation of the rate equation (|34p we recover 
dN/dt = dL/dt = r - 1. 

Following our above analysis, we study the fraction of 
nodes with in-degree k, a^. Making the transformation 
Ak = Nak, the degree distribution satisfies 

(fc -t- 2r)ak = r5k,o + ra^^i -I- ^ zajOj (35) 

We analyze this hierarchy of equations using the gen- 
erating function U{x) = X^fe^fc-^*^- The normalizations 
Efe«fe = Efc kau = 1 imply C/(l) = [/'(!) = 1. By mul- 
tiplying the recursive equations (|35p by and summing 
over fc, we find that the generating function obeys the 
nonlinear ordinary differential equation 

{x - U)'^ ^ r{x - 2)U + r. (36) 
dx 

This equation is consistent with the normalization con- 
ditions U{1) = U'{1) = 1. 

The moments of the degree distribution defined by 
rrin = Efe Ofc/c" are obtained by successive differentiation 
of the generating function. In particular, the high-order 
moments grow rapidly although they do remain finite 
(appendix B) 

-1 n 

™" --[(731)2] (37) 

as n — > 00. Using these asymptotics we deduce (see Ap- 
pendix B) that the tail of the degree-distribution decays 
as a stretched exponential 

flfe ^ e-^^ (38) 



with A = [2(r - 

How well does this approximation compare with 
addition-deletion networks? In addition-deletion net- 
works, the root has a macroscopic degree. In other words, 
it is a condensate that contains a finite fraction of all 
nodes. In the aggregation process ([55]) all clusters are 
finite, as seen from U'{1) = "^i^kak = 1 [11]. This is 
a consequence of the fact that all nodes are treated in 
the same way. Introducing a fixed node that can not be 
deleted should significantly improve the approximation. 

We also examined the degree distribution using Monte 
Carlo simulations of the aggregation process ([55)1 and the 
addition-deletion process. We find that when the degree 
is very small, the aggregation process approximates the 
addition-deletion process very well (Figure [4]) . For small 
degree this agreement holds independent of r. However, 
this approximation is poor when the degree is large. 

For addition-deletion networks, the degree distribution 
follows a power law behavior (Figure [5]) 

au k--^ (39) 

as fc — > 00. Hence, degree-degree correlations are negli- 
gible for small degrees but substantial for large degrees. 
Indeed, highly connected nodes are long lived and thus, 
involve strong memory. 

Addition-deletion network can be simulated using an 
efhcient algorithm that is linear in the network size. 
Throughout the growth process we keep track of all 
nodes, whether they have been deleted (passive nodes) 
or not (active nodes). In an addition event, an active 
node is added and it links to a randomly selected ac- 
tive node. In a deletion event, an active node is marked 
passive. When the required network size is reached, pas- 
sive nodes are removed in reverse chronological order to 
preserve the ancestral relations. 

Simulations of the aggregation process are performed 
as follows. The system consists of aggregates, each with a 
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given size. In an addition step, an aggregate of size zero is 
added and simultaneously, the size of another, randomly 
chosen aggregate increases by one. In a deletion step one 
aggregate is chosen with probability proportional to its 
size and it merges with another randomly chosen aggre- 
gate according to ([33]) . 

VI. SUMMARY 

We have studied networks which undergo a biased 
growth — new nodes are added at a certain rate and old 
nodes are deleted at a smaller rate in such a way that 
the ancestry is preserved. This growing network model 
is relevant to biological and technological networks that 
preserve ancestral relations between nodes. Our main 
conclusion is that it is possible to characterize the struc- 
ture of such networks analytically. For instance, the in- 
component size distribution has a power-law tail and the 
characteristic exponent varies continuously with the ad- 
dition rate. We have also shown that the height dis- 



tribution obeys Poisson statistics. Additional properties 
including the fraction of dangling nodes and the average 
degree as a function of height have been obtained as well. 

Interestingly, the addition-deletion process results in 
a giant hub that is connected to a finite fraction of the 
nodes in the system. This fraction decreases to zero as 
deletion becomes weaker, showing that the deletion pro- 
cess causes this condensation phenomena. 

We have also seen that the degree distribution evolves 
by an aggregation process since the parent node inherits 
all incoming links of a deleted node. Treating the under- 
lying aggregation process as completely random is valid 
only at small degrees because degree-degree correlations 
become significant at large degrees. Constructing closed 
equations for this aggregation process from which the 
degree distribution can be calculated is an outstanding 
challenge. 
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APPENDIX A: DERIVATION EQ. (fT4l) 

The tail of the in-component size distribution can be 
obtained from the x — > 1 asymptotics of the generating 
function F(x). Let us consider for example the power- 
law tail Cfc ~ Bk~^ as fc cx). In this case, the leading 
behavior of the derivative of the generating function is 



dF{x) 
dx 



= kckx''-^ ^bJ2 k^^^^ = -B ln(l 



in the limit x ^ 1. Generally, the power- law decay ^ 
implies 



df^-^F{x 
dx0-^ 



^Bln(l - x) 



(Al) 



as a; ^ 1, for any positive integer /3. 

To evaluate this derivative, we rewrite the generating 
function p2|) using the transformation z ~ (/3 — 1)(1 — y) 



A/3-1 



/ N R X(l — X) 

F{x)=r^^ 



(1 - 

dz^- rl . (A2) 



For arbitrary positive integer /?, the integral is evaluated 
by performing f3 successive integration by parts 



F{x)-- 



' {rx - \f 



A/3-1 



[rx - \Y 



ln(l-a;) (A3) 



where Pnix) is a polynomial of degree n. 

Differentiating the generating function (|A3[) /?— 1 times 
and noting that the second term dominates in the x — > 1 
limit, we obtain 



d/^-^Fjx) 
dx0-^ 



-(/3-1)! 



ln(l - x). (A4) 



By comparing this expression with (|A1[) we obtain the co- 
efficient B = r(/3)(/3 - 1)'^ for aU integer /? > 2. Finally, 
we use analytic continuation and extend this result to 
arbitrary (3. 



APPENDIX B: DERIVATION OF ^ AND (fM]) 

We obtain the large-A: behavior of the degree distribu- 
tion using the .t — > 1 behavior of the generating func- 



tion U (x) . Differentiating equation p6|) n times and set- 
ting a; = 1 gives a recursion relation for the derivatives 
L/„ ^d"{/(a;)/da;"|,=i 



r n{n + r) 



+ r) 1 fn\ 



The first two coefficients are Uq ~ Ui = 1. Only the first 
term in the recursion equation is relevant asymptotically 
and therefore. 



■ n 



(Bl) 



as n oo. Eq. (|37p is obtained by noting that the 
coefficients 



Un^Yk{k-l)...{k-n + l) 



k>0 



are asymptotically equivalent to the moments, [/„ 
The large-rt behavior of r7i„ is computed as follows: 



dk ¥^ak 
ldke.,{nlnk-XV-k 



exp 

2 
A 



2r7,ln(2n/Ae) 

2n 

(nlf. 



(B2) 



In the first line we replaced the summation by in- 
tegration, and in the second we used the presumed 
asymptotic ~ e~'^^. The third line was obtained 
by using the steepest descent method — the func- 
tion /(fc) = nln/c — XVk is maximal at k* = {2n/\Y. 
The fourth line was obtained using the Stirling formula 
n\ ^ (njef'' . Finally, by comparing equations (jB2p and 
([37)) . we obtain the tail behavior ((38|) [3a. 



